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- Abstract 

We discuss whether one should expect that multiply imaged QSOs can be understood 
with 'simple' lens models which contain a handful of parameters. Whereas for many lens 
systems such simple mass models yield a remarkably good description of the observed 
properties, there are some systems which are notoriously difficult to understand quanti- 
tatively We argue that at least in one case (B 1422+231) these difficulties are not due to 
a 'wrong' parametrization of the lens model, but that the discrepancy between observed 
and model-predicted flux ratios are due to substructure in the lens. Similar to microlens- 
^ ■ ing for optical fluxes, such substructure can distort also the radio flux ratios predicted 
by 'simple' mass models, in particular for highly magnified images, without appreciably 
^ ! changing image positions. Substructure also does not change the time delay significantly 
and therefore has little effect on the determination of the Hubble constant using time 
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delays. We quantify these statements with several simple scenarios for substructure, and 
propose a strategy to model lens systems in which substructure is suspected. 



O 1 1 Introduction 

CO 



Multiply imaged QSOs and radio (Einstein) rings provide the most accurate mass mea- 
sures of distant galaxies (Zwicky 1937; for specific examples, see e.g. Rix, Schneider & 
Bahcall 1992; Wallington, Kochanek & Narayan 1996) and promise to provide one of the 
most robust methods for measuring Hq (Refsdal 1964; for recent examples, see Falco et 
al. 1996; Courbin et al. 1997). 1 These successes and expectations are based on our ability 
to understand the lensing geometry in sufficient detail. It is truly remarkable that many 
multiple QSOs can in fact be modelled quite accurately with a simple elliptical deflection 
potential or an elliptical mass distribution. This has nourished the expectation that most 
lens systems are in fact due to fairly simple mass distributions (a well-known exception 
is the system MG2016+112; see Nair & Garrett 1997). 

When modelling a multiple QSO lens system, one can either include or disregard 
the flux ratios of the images. Given that the number of observational constraints in 
these systems is never much larger than the number of free parameters of the lens model 
(and often is the same), there is of course a strong motivation to make use of the flux 
ratio information. As pointed out by Chang & Refsdal (1979), this may be a dangerous 
undertaking, given that the sizes of the optical continuum emitting regions of QSOs are 
expected to be of the same order as the Einstein radius of a star in the lens galaxy, 
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so that the optical magnitudes may well be affected by gravitational microlensing (see 
Wambsganss 1990, and references therein), even if averaged over long periods of time. For 
example in the case of QSO2237+0305, the image positions can be fitted very accurately 
with a variety of lens models (e.g., Kent & Falco 1988, Rix et al. 1992, Wambsganss 
& Paczyhski 1994), but the observed optical flux of image D is smaller than predicted, 
whereas the radio flux ratios (Falco et al. 1996) are in much better accord with the lens 
models. Therefore, constraints derived from optical fluxes should be used with care only, 
whereas radio sources are expected to be extended much beyond the Einstein radius of a 
stellar mass object, so that their flux should be (largely) unaffected by microlensing and 
thus provide useful constraints for lens modelling. 

That argument would of course be weakened if lens galaxies contain mass clumps 
with an Einstein radius comparable to the size of the radio components - i.e., masses 
of the order of, or exceeding, globular clusters. In that case the same situation as 
microlensing in the optical would apply. But this 'milli-lensing' could cause observable 
image splittings (Wambsganss & Paczyhski 1992) which have not yet been detected, so 
that this possibility appears not very plausible. 

Nevertheless, as we shall argue in Sect. 2, the difficulty in modelling some lens sys- 
tems in detail, with radio flux ratios included, points towards the possibility of substruc- 
ture in lens galaxies (or somewhere else along the line-of-sight to the QSO). Two kinds 
of substructure are briefly considered in Sect. 3, and numerical experiments to determine 
the probability of appreciable flux-ratio changes are presented in Sect. 4. In Sect. 5, we 
offer a practical way of treating (radio) flux ratios in the presence of substructure. We 
summarize our results in Sect. 6. 

2 B1422+231: More than a challenge for lens modellers 

The quadruply imaged QSO 1422+231 at z s = 3.62 was discovered in the course of the 
JVAS survey (Patnaik et al. 1992). The four images have a maximum separation of 
l'/3, and the lens galaxy has been accurately located (Impey et al. 1996) and its redshift 
(zd = 0.34) has recently been measured (Kundic et al. 1997, Tonry 1997). The flux ratios 
of the images are different in the radio and optical bands; in the radio, they are A:B:C:D= 
0.98:1:0.52:0.02, whereas in the optical, image A is fainter than B, yielding A:B?a0.8:l, 
somewhat dependent on the optical filter (the radio flux ratios are nearly independent of 
radio frequency), whereas the B:C:D ratios are largely compatible with the radio. Given 
that the optical flux may be affected by microlensing and/or dust obscuration, the radio 
flux ratios should be used in modelling this system. 

Several serious attempts have been made to model the lens in this system (e.g., 
Hogg & Blandford 1993, Kormann, Schneider & Bartelmann 1994b, Keeton, Kochanek 
& Seljak 1997) - and they all failed! Whereas the image positions can be fitted very 
accurately, the radio flux ratios could not be obtained. Given that the parametrized lens 
models used by the different authors differ moderately, this failure is probably not due to 
a too restricted choice of the families of lens models. Rather it may be a generic difficulty 
for 'simple' (i.e., smooth) lens models, as can be seen as follows. 

The large flux ratio between each of the images A, B, C, and the image D ~ 50, leaves 
two possibilities: (i) image D is highly demagnified, or (ii) A, B, C are highly magnified. 
We can exclude option (i), since the separation of D from the center of the lens galaxy 
- 0'/3 - does not put it into a region of very high surface mass density as is necessary 
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to obtain a strong demagnification. This leaves option (ii) only. Three highly magnified 
images occur generically when a source is close to, and inside a cusp (see Chap. 6 of 
Schneider, Ehlers & Falco 1992). In that case, there exists a universal relation between 
the image fluxes (see, e.g., Schneider & Weiss 1992; Mao 1992), namely that the sum of 
the fluxes of the outer two images (A and C) should equal the flux of the middle image 
(B). This relation is grossly violated in 1422+231, although the VLBI structure (Patnaik 
& Porcas 1997) shows a strong elongation in the direction tangent to the direction to the 
lens galaxy thus supporting the cusp hypothesis. Hence, although this universal relation 
is strictly valid only asymptotically for very high magnification, or in other words, if 
the source is sufficiently close to the cusp (and the theory of the lens mapping near 
cusps developed so far does not provide a good handle of where this 'asymptotic regime 
begins'), this argument easily explains why all the published lens models yield flux ratios 
with (A+C)/B^1.3 (the observed value in the radio is 1.5). A strong violation of the 
cusp constraint can be obtained either by putting the source further away from the cusp 
- that would decrease the magnification, thus the magnification ratio to D, therefore 
running into the same problem as option (i) above - or to impose a small-scale structure 
on the lens model which can locally change the magnification of individual images. Here, 
small scale means a scale smaller than the separation between A, B, and C, i.e., smaller 
than ~ 0'/5. 

Although B 1422+231 is the system where problems in the modelling are most easily 
seen, it may not be unique. Keeton et al. (1997) and Falco, Lehar & Shapiro (1996) have 
modelled MG0414+0534, and in both attempts the resulting % 2 per degree of freedom 
is too large to view these models as satisfying. In this system, the components Al and 
A2 have a separation much smaller than to the distance to the other two components. 
This suggests that they lie close to a critical curve passing between them. In that 
case, one would expect the fluxes of Al and A2 has to be much larger than those of B 
and C, and that the flux ratio A1/A2 is of order unity. This expectation is not at all 
satisfied with the optical data, and so again one should not be surprised to find that 
'simple' models fail (for this case, microlensing by stellar mass objects offers a plausible 
explanation, see Witt, Mao & Schechter 1995). A similar situation occurs in PG1 115+080 
(Keeton & Kochanek 1996) where all the x 2 of the best fits comes from the flux ratio of 
A1/A2, which is significantly different from unity (Courbin et al. 1997), even though a 
much larger flux uncertainty is assumed for the individual images than the measurement 
uncertainty in order to account for microlensing. We consider these difficulties as a hint 
that substructure may be present in the lens galaxies. In the next two sections, we 
shall show how small amplitude 'perturbations' of the mass distribution in the lens are 
sufficient to lead to the observed discrepancies. 

3 Analytic estimates 

We shall consider two particular hypotheses for the kind of substructure that may be 
present in a lens galaxy like that in B 1422+231: globular clusters (or in lensing language: 
point masses of order 1O 6 M ), and small amplitude fluctuations. 

3.1 The 'globular cluster' picture 

A point mass of ~ 1O 6 M yields a deflection angle at its Einstein radius of about 1 
milli-arcsecond, and thus can cause multiple images with similar separation, provided 
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that source and lens are sufficiently well aligned. Since we are interested in small per- 
turbations of the magnification of a macroimage, the typical situation will be one where 
the separation of the point mass from the image is considerably larger than the Einstein 
radius, so that the deflection angle is also smaller, and the secondary image will be highly 
demagnified. 

Consider a macroimage with local surface mass density k,q and shear 70, with coordi- 
nates chosen such that only the 1-component of the shear caused by the macro-model is 
non-zero. Let a point mass be located at position (x cos 92, x simp) relative to the image, 
where x is measured in units of the Einstein radius of the point mass. The unperturbed 
and perturbed magnifications are then, respectively, 

1 = 1 m 

^ (l-*o) 2 -7o 2 ' " (l-^o) 2 -(7o + 5 7 i) 2 -57 2 2 ' U 

where (^71,(^72) = (cos2</?, sin2ip)/x 2 is the shear caused by the point mass lens. Let 
R = Ho/fjL be the ratio of the unperturbed to the perturbed magnification; its dependence 
on x and <p can be written as 



2 7oA*o cos 2ip I / 1-R \ 2 7o ^o cos 2v? 

x = -w y + V 1 + ^tockw ) x i-R • (2) 

where in the second step we assumed that |1 — R\ <C 1, since we are interested in small 
magnification perturbations. Thus, locations x of the point mass for constant flux per- 
turbation R trace an '00 '-like curve. The area of the region in which a point mass has 
to be located in order to cause a flux perturbation larger (smaller) than R for R > 1 
(R < 1) is 



A = 2 



7o 



1-R 



(3) 



as can be easily obtained from integrating (2) over ip. For the lens models applicable 
to B1422+231, the A image should have hq ~ 15, 70 ~ 0.5, and so the area in which 
a point mass lens has to be located to cause a flux perturbation of ±20% is A ~ 150. 
Thus, with a surface mass density ~ 0.01, one obtains a high probability to find one 
of these point-mass lenses inside the region where it causes an appreciable magnification 
change. 

3.2 Plane wave perturbations 

As a second 'model' for lens perturbations, we consider a plane density wave. For sim- 
plicity of the argument, we again assume that the shear caused by the macromodel has 
only a 1-component, and that the plane wave is modulated also in the 1-direction. Then, 
5k = <5 7 i, 572 = 0. The ratio R then becomes, up to first order in the perturbation 5k, 

R=1-5k . (4) 

1 - k - 70 

The denominator in (4) is one of the eigenvalues of the magnification matrix caused by 
the macromodel. For images near a critical curve, one of the eigenvalues becomes very 
small, so that R can deviate significantly from unity even for small 5k. For the A image 
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of B1422+231, one expects k + 70 ~ 0.9, so that for a 20% change in magnification one 
needs 5k ~ 0.01, a truly small fluctuation. 

4. Numerical simulations 

We model the primary lensing galaxy Gl as a singular isothermal ellipsoid (SIE) density 
distribution and take into account the effect of several nearby galaxies as an external 
shear (Kormann, Schneider & Bartelmann 1994b). The external shear may also include 
contributions from large scale structure and/or the deviation of the primary lens galaxy 
from the SIE model. We choose the coordinate system on the lens plane, x = (xi,x 2 ), 
centered on the primary lens galaxy, with the xi-axis along (decreasing) right ascension 
and the X2-axis along (increasing) declination (cf. Fig. 3). The corresponding positions in 
the source plane y = (2/1,2/2) have the same orientations. The dimensionless coordinates 
x and y are the observed angles normalized to 6>o, an angular scale (Einstein radius) which 
characterizes the strength of the lens potential. We express the surface density in units 
of the critical surface density, E cr . For an Einstein-De Sitter universe, 1" = 3.0/i _1 kpc 
at the lens plane, and the critical surface density is U cr = 3650/iMq/pc 2 , here h is the 
present-day Hubble constant in units of 100 km s _1 Mpc -1 and we take h = 0.5. 

If the minor axis of the lens is along the xi-axis, the SIE potential can be written as 

VT f 

ip g (x,(f)) = x Tp~ x [siv-tfi arcsin(/ sin0) +cos</> arcsinh (— cos0)] , (5) 

where / is the axis ratio, /' = yjl — f 2 , and we have used polar coordinates x = 
(x cos (f>, x sin 0) . The shear perturbation is given by 

2 

x 

^7(^0) = y7cos(20) , (6) 

if the shear direction is along the a^-axis. The total potential is the sum of the SIE 
potential and the shear perturbation, 

V>o = ^ e (x,(j>- g ) +if) 1 (x,<i)- S ) , (7) 

where we have now generalized to the case when the lens minor axis encloses an angle 
4> g , and the direction of shear an angle of <f> s with the a; 1 -axis. The lens equation is then 
simply: y = x — VV>(x). The Jacobian of the lens mapping is A = <9y/<9x, and the 
magnification of an image is given by = (det^l) -1 . We refer the readers to Kormann 
et al. (1994a,b) for more technical details. 

4.1 A macromodel 

Observational data is taken from Tables 1 and 2 in Impey et al. (1996). The 
total number of observational constraints is 12, namely we have 4x2 image positions 
and 4 fluxes. Our lens model is described by 8 parameters, i.e., three lens parameters, 
(4> g , /, #0)? two shear parameters, (7, <f) s ), two source coordinates, and the unlensed source 
flux So. Thus, the number of degrees of freedom is 4. Nearly identical procedures to 
those in Kormann et al. (1994b) are used to find the best lens model. A minor difference 
is that we have only included the flux ratio D:B in the x 2 measure, since we argue 
that substructures may have modified the flux ratios of the brighter images significantly. 
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There are a few models that can fit the data with acceptable % 2 (cf. Keeton et al. 1997). 
One of these is summarized in Table 1. The model reproduces the observed positions 
within la of the observational uncertainties. The relative magnifications are, however, 
unsatisfactory. The flux ratio A:B is 0.78, close to the observed optical flux ratio, but 
deviates significantly from the observed radio value 0.98. In addition, image D is too 
bright compared with the observed value. 



Table 1: Model Parameters 



#0 


/ 


<Pg 7 


4>s fJ>A 






fJ>D 


0.77 


0.78 


-1.01 0.20 


-1.04 7.01 


-9.01 


4.52 


-0.33 



4.2 The 'globular cluster' picture 

The presence of massive lenses with mass ~ 1O 6 M provides a perturbation poten- 
tial: 

5i/j = m S ^-\n[(x-x l ) 2 + (y - yi )\ m = — (8) 

on top of ipo, where 6q is again the Einstein radius, Di = 610/i -1 Mpc is the distance to 
the lens galaxy, and M = 5 x 1O 5 M is the mass of the lenses. For simplicity we assume 
that these lenses are uniformly distributed and have a surface density of in units of the 
critical surface density. Similar to microlensing by solar mass objects (Wambsganss 1990), 
we generate point lenses randomly on the lens plane with the required surface density. To 
quantify the deviation of magnifications from that of the unperturbed potential, we define 
a quantity r = (/jla + Hb + Hc)/(\ha\ + |a*.b| + |a*c|)- If the magnifications of image A, B 
and C strictly obey the prediction of a cusp singularity, r = 0. The input value resulting 
from the unperturbed potential ipo is r = 0.123. In Fig. 1, we show the probability 
distribution of r as obtained from Monte Carlo simulations. For = 0.005,0.01, the 
probability of having a deviation larger than the observed one is about 21% and 36%. 
Therefore, a surface density in 'globular clusters' k* 0.005 may be sufficient to explain 
the observed deviation in flux ratios. Nearly in all the cases, the change in the image 
positions are a few milli-arcsecond. Such small changes are undetectable in the optical. 
In any case, such small changes can probably be accommodated by a small adjustment 
in the macromodel. 

4.3 Plane wave perturbation 

For each image, we consider a plane-wave fluctuation centered on the image, de- 
scribed by 

2 Ks x2f> o 
Sip = — | cos(k 1 x 1 + k 2 6 x 2 + </>o)e , (9) 
k 

where k s is the amplitude of fluctuation, (ki, k 2 ) is the wave vector, <po is a phase constant, 
and the last exponential term localises the perturbation within a few a r . Notice that in 
the absence of the last term, the amplitude of surface density corresponding to Sip is k s . 
Since the parameter space is rather large, we limit our study to one illustrative case. We 
fix the magnitude of the wave- vector to be 27r/|k| = O'/Ol but with random orientations. 
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Fig. 1. Probability distribution for the quantity r = {ha + Mb + Mc)/(Ima| + \hb \ + |mc|) f° r a sin- 
gular isothermal ellipsoid with external shear and massive point lenses, r measures the deviation of 
magnifications from the prediction (r = 0) of a cusp singularity. Equal-mass point lenses are distributed 
uniformly with surface density ft* . The dashed vertical line indicates the input value for the unperturbed 
lens model. The solid vertical line shows the observed value of r = 0.2. Three curves are shown for 
k* = 0.0025, 0.005 and 0.01. The probability of having a deviation larger than the observed one is 11 
(21, 36)% for ft* = 0.0025 (0.005, 0.01), respectively. 

4>o is distributed uniformly between to 2-tt, and the amplitude of fluctuations is drawn 
from a Gaussian distribution with zero mean and standard deviation a K . We take a r = 
0'/03, but the results are insensitive to the choice of a r . In Fig. 2, we show the probability 
distribution of r for o K = 0.005,0.01,0.02. The probability of having deviations larger 
than the observed one is 1.5 (6.8, 13)% for a = 0.005 (0.01, 0.02), respectively. These 
values are in good agreement with the analytical estimates presented in Sect. 3. 

4.4 Size of fluctuation 

Let us estimate how small (in size) the smooth fluctuations have to be in order to 
explain the observed deviation in B1422+231. We shall consider a similar perturbation 
potential as that in Eq. (9), but without the localisation exponential term. For simplicity, 
we shall consider Fourier modes that are stationary in a square with a side length, L, 
centered on the primary lens galaxy, i.e., the perturbation potential is 

N 1 O 7 2 

^^iilm , 7T . t7T. , 7T . 17171 . , 7T /9 0N . . 

W=J2 E -rpcos^te- yJcosC^oa*-— ), tij = (? + f) (10) 

i,j=l l,m=0 V 

where / and m are either or 1 for cosine and sine waves, % and j specify the order of 
Fourier modes, iV controls the number of Fourier modes used, and /%; m are the surface 
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Fig. 2. Probability distribution for the quantity r = {ha + + McO/GmaI + \i i b\ + l^cl) f° r a singular 
isothermal ellipsoid with external shear and localised plane- wave fluctuations (cf. eq. [10]). The am- 
plitude of fluctuations is drawn from a Gaussian distribution with standard deviation a K . Three curves 
are shown with a K = 0.005, 0.01, 0.02. The probability of having deviations larger than the observed 
one is 1.5 (6.8, 13)% for a K = 0.005 (0.01, 0.02), respectively. The dashed and solid lines have the same 
meaning as those in Fig. 1. 

density of fluctuations for the mode specified by i, j, I, and m. Smaller and smaller scale 
fluctuations are used when N increases. We take the side length of the box to be 4". 
This size is chosen such that the periodic boundary condition does not influence the 
central region of the deflection potential. Notice that the number of extra parameters 
is 4iV 2 , therefore for N = 1, the number of degrees of freedom is already zero; for 
N > 2, the system has more free parameters than constraints. The best fit parameters 
are again found by minimizing a x 2 measure, which now utilizes all the flux ratios (cf. 
Kormann et al. 1994b). To avoid excessively large fluctuations, we include a term 5x 2 = 
m ( K ijZm/0.05) 2 . For the model presented in Table 1, we find x 2 = 82,12,6,2 
for N = 0,1, 2, 3, while for N > 4, x 2 has essentially dropped to zero. For N = 0,1, 2, 
most x 2 are from observed fluxes and positions. It is interesting that for N = 1,2, when 
the number of parameters is equal to or larger than the number of constraints, the x 2 
remains high. This shows that large scale fluctuations are ineffective in reducing the 
X 2 , again highlighting the need for small-scale structures. The fluctuation wavelengths 
corresponding to N = 4, 5 are 2" , l'/6. Therefore fluctuations with wavelength of <J 
l'/5 are sufficient to reproduce the observed flux ratios. The resulting surface density 
distribution for iV = 5 is shown in Fig. 4 as the solid contours. It is obvious that the 
contours close to the brighter images are somewhat twisted. In particular, the surface 
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density at the positions of image A and C are increased while that at the position of 
image B is decreased. The magnifications are 15.3, —15.6, 8.1 for A, B and C, respectively 
while that for image D is essentially unchanged (—0.32). However, notice in Fig. 3, the 
outer density contours are moderately distorted. Such distortions can be avoided by 
fluctuations with smaller wavelengths. Therefore smaller scale fluctuations are preferred. 




-Aa (") 

Fig. 3. The contours of constant surface density for a singular isothermal ellipsoid with external shear 
and plane wave fluctuations described by eq. (10), with N = 5 and L = 4". The contour levels are 
4, 2, 1, 0.5 and 0.25 of the critical surface density, respectively. The thin lines show the corresponding 
contours for the underlying singular isothermal ellipsoid. The four crosses show the image positions and 
the central dot shows the position of the primary lens galaxy, Gl. 

It should be noted that the model presented in Fig. 3 is not at all unique. The 'best 
fit model' depends certainly on the macromodel from which the ^-minimization was 
started, and even for a fixed starting model, there are many different models for which 
X 2 is nearly zero for A > 4. Therefore, the model in Fig. 3 should not be viewed as 
a realistic lens model, but the exercise should demonstrate the largest possible scale of 
substructure which can bring the flux ratios within the observed range. 

5 Practical modelling of lenses with substructure 

If we accept the view that the surface mass density of lens galaxies is not as smooth 
as described by 'simple' lens models, using flux ratios for modelling may yield misleading 
conclusions. On the other hand, discarding them altogether would result in a loss of 
information; in particular, as long as the perturbations are small, the magnification is 
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only weakly affected for images which are not highly magnified. Therefore, a statistical 
treatment may be appropriate. In this section we propose a simple method to perform 
lens modelling in the presence of small-scale structure in the lens. 

Let p^dul ;ko, I70I) be the probability density for a magnification given that the 
smooth model has surface mass density Ko and (complex) shear 70. If the unlensed flux 
of the source is So, the probability density for the magnified flux S = \fi\ S becomes 



Ps(S;k , |7o|) = -^JV 
>->o 



S_ 



;«o, l7o 



(11) 



Furthermore, let the observed flux be 5'( obs ) = xS, where the deviation of x from unity 
accounts for measurement uncertainties. Then, the likelihood to obtain a flux measure- 
ment <>( obs ) becomes 



Ps(S (obs) ;Ko,|7o|) = J ^Ps(S;K ,ho\)p. 



g(obs) 



(12) 



where p x (x) is the probability density of the 'measurement error' x. Therefore, for N 
images with observed positions 0^ obfe ' ) , positional uncertainty Oi, and observed fluxes 
g(obs) ^ ^ e likelihood function that can be maximized for obtaining model fits is 



i=l 



exp 



e r -e 



(obs) 



PS 



( o(° bs ). K . 



(13) 



where 6i are the predicted image positions, and Ki = Ko(Oi) and 7, = |7o(#i)| are the 
surface mass density and shear at these positions. 

For concreteness, we shall assume that the determinant D = det(A) = (1 — k — 
8k) 2 — I70 + ^7| 2 is distributed like a Gaussian with mean (D) and dispersion <jr>. If, for 
example, the perturbation quantities 8k and £7 were Gaussian, 



p{8k,8^) = 



1 



exp 



{8k) 2 W 



2a 2 



then D follows very closely a Gaussian distribution, with 



{D)=D + al-a: 



a 2 D = 4(1 - Kofal + 2 | 7o | z a 2 + a* + 2a\ 



(14) 



(15) 



where Do = (1 — kq) 2 — |7o| 2 is the determinant for the unperturbed lens. Since \x = 
1/D, the probability density for the magnification is = Pd(1/ 70 / 1 ^ \ and so the 

probability density for the absolute value of fx becomes 



Pm(H) 



1 



PD r-r + PD t-t 



-1 



(16) 



for notational simplicity we currently drop the arguments Ko and 70. Assuming a Gaus- 
sian distribution for x, with mean 1 and dispersion a x , we obtain from the foregoing 
expressions that 
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2tiSq<j D@x Q> \ 2 
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Cl + c 2 
2v^ 



Cl - c 2 
2v^ 



(17) 



and we introduced the abbreviations 



a 



ci = 



1 (S(° bs )/S ) 2 



2al 

g(obs) 
z 2 -b 



2al 



c 2 



DY 1 

+ 



2o* 



2** ' 



(18) 



r 2 ' 



(7(2) =ze z erfc(— 2) , 



where erfc(z) is the complimentary error function. This expression for ps is easy to 
calculate, e.g., by using the routine erf cc of Press et al. (1992). 2 In Fig. 4, we have plotted 
the function ps(S^ ohs ^) for several combinations of Ko and 70. As can be readily seen, the 
probability distribution broadens considerably for larger values of the magnification of the 
macromodel. Hence, the fluxes of weakly magnified images provide strong constraints, 
whereas highly magnified images yield weaker constraints on the model. 



10 



.a 
o 

"Hi 



0.1 



CO 



° 0.01 - 



0.001 



0.0001 




Fig. 4. The probability density PsiS^- ^), according to eq. (17) of the text. The parameters of the 
macromodel was chosen to be ko = 70, with ko = 0.1 (left-most curve), 0.2, 0.3, 0.4, 0.45, 0.48. The 
probability density for the perturbations 5k and £7 have been assumed to follow a Gaussian, with width 
o~k = o~ 7 = 0.02. Furthermore, the measurement errors were characterized by a Gaussian distribution in 
x with width a x = 0.02. It is easily seen that images with higher magnifications attain a much broader 
flux distribution than those where the magnification is of order unity. 



A FORTRAN77 routine which calculates ps(S^ ohs ^) can be obtained from the authors on request. 
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6 Discussion 



We have considered the possibility that the flux ratios in lens systems may be affected 
by small scale structure in the lens galaxy. Although we discussed only the SIE model, 
we have verified that it still seems difficult to fit the observed flux ratios using other 
power-law models of the deflection potential. A general analytic argument was presented 
why 'simple' lens models are unable to account for the radio flux ratios in B1422+231. 
We considered two potential pictures for substructure, namely 'globular clusters' with ~ 
1O 6 M and smooth perturbations, modelled as plane density waves. We have considered 
both pictures analytically, as well as through numerical simulations. Owing to the large 
magnifications of images A, B, C in 1422, only small perturbations (in mass) are needed 
to change magnifications, and thus flux ratios appreciably. 

The required perturbation is of the order of ~ 1% of the critical surface density, 
or about (40hM Q pc~ 2 ) in physical units. There are about 200 globular clusters in the 
Galaxy with a total mass of ~ 10 8 Mq. These globular clusters form a quasi-spherical 
distribution concentrated toward the galactic center. The average surface density of these 
objects within 5kpc is ~ lM p C - 2 . This is 2.5 x 10 4 /i 1 of the critical surface density, 
somewhat too low to produce a significant change in magnification (cf. Fig. 1). The 
'globular cluster' picture should apply equally well to the hypothetical massive black 
holes in galactic halos (e.g., Lacey & Ostriker 1985). They can produce much more 
visible deviations if just a few percent of the halo mass are in these objects. Our second 
model of substructure addresses smooth fluctuations. An obvious example of smooth 
fluctuations is the spiral arms observed in disk galaxies. Smooth inhomogeneities are 
also quite naturally expected in hierarchical structure formation models as a result of 
continuous merging and accretion of sub-clumps. 

An important question is whether the added perturbation affects the time delay 
significantly: we found from models like those shown in Figs. 2 and 3 that the perturbation 
changes the time delay at most by a few percent. For smaller-scale perturbations than 
the example shown in Fig. 3, the time delays will be affected even less. Therefore we 
conclude that substructure can have only minor effects on the time delay - gravitational 
lensing is still potentially a golden tool for determining the Hubble constant. 

We thank Matthias Bartelmann, Simon White and Hongsheng Zhao for helpful dis- 
cussions and comments on the paper. This work was supported by the "Sonderforschungs- 
bereich 375-95 fur Astro-Teilchenphysik" der Deutschen Forschungsgemeinschaft. 
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